Detecting and accounting for multiple sources of positional variance in peak list registration analysis and spin system grouping

نویسندگان

  • Andrey Smelter
  • Eric C Rouchka
  • Hunter N B Moseley
چکیده

Peak lists derived from nuclear magnetic resonance (NMR) spectra are commonly used as input data for a variety of computer assisted and automated analyses. These include automated protein resonance assignment and protein structure calculation software tools. Prior to these analyses, peak lists must be aligned to each other and sets of related peaks must be grouped based on common chemical shift dimensions. Even when programs can perform peak grouping, they require the user to provide uniform match tolerances or use default values. However, peak grouping is further complicated by multiple sources of variance in peak position limiting the effectiveness of grouping methods that utilize uniform match tolerances. In addition, no method currently exists for deriving peak positional variances from single peak lists for grouping peaks into spin systems, i.e. spin system grouping within a single peak list. Therefore, we developed a complementary pair of peak list registration analysis and spin system grouping algorithms designed to overcome these limitations. We have implemented these algorithms into an approach that can identify multiple dimension-specific positional variances that exist in a single peak list and group peaks from a single peak list into spin systems. The resulting software tools generate a variety of useful statistics on both a single peak list and pairwise peak list alignment, especially for quality assessment of peak list datasets. We used a range of low and high quality experimental solution NMR and solid-state NMR peak lists to assess performance of our registration analysis and grouping algorithms. Analyses show that an algorithm using a single iteration and uniform match tolerances approach is only able to recover from 50 to 80% of the spin systems due to the presence of multiple sources of variance. Our algorithm recovers additional spin systems by reevaluating match tolerances in multiple iterations. To facilitate evaluation of the algorithms, we developed a peak list simulator within our nmrstarlib package that generates user-defined assigned peak lists from a given BMRB entry or database of entries. In addition, over 100,000 simulated peak lists with one or two sources of variance were generated to evaluate the performance and robustness of these new registration analysis and peak grouping algorithms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Multiple-Variable Matching to Identify EFL Ecological Sources of Differential Item Functioning

Context is a vague notion with numerous building blocks making language test scores inferences quite convoluted. This study has made use of a model of item responding that has striven to theorize the contextual infrastructure of differential item functioning (DIF) research and help specify the sources of DIF. Two steps were taken in this research: first, to identify DIF by gender grouping via l...

متن کامل

An Empirical Analysis on Effects of Internal Control System on Tax Revenue Audit Performance; Evidence from Ethiopian Ministry of Revenue South and Southwestern Districts

This study analysis the effects of internal control system on Tax revenue audit performance in Ministry of Revenue South and southwestern districts under explanatory research design. The study employed primary data sources and analyzed it using a multiple regression analysis on SATA 14 software. The regression analysis results exhibited that control activities, control environment and informati...

متن کامل

Organizational Factors Affecting the Growth and Success of Academic Spin-offs

The present study aimed to identify the organizational factors affecting the growth and success of academic spin-offs. The research was conducted based on a mixed method design and the study population consisted of experts of incubation centers and spin-offs. The participants were selected using purposive sampling in the qualitative part and random stratified sampling in the quantitative part. ...

متن کامل

Systematic & Structural Analysis of Innovation System of Iran’s Oil Industr

In this descriptive qualitative research with case study research strategy, the Iranian oil industry innovation system in the form of adopting a structural and functional approach (systemic approach) in depth and in its natural context from the perspective of the participants studied. Based on theoretical framework that includes five systemic components of innovation system, actors and relation...

متن کامل

Optimizing Energy Costs and Water Whithdrawal in a Prototype Water Supply System

Water requirements in many large water supply systems are provided by mixing water withdrawal from surface and underground resources with different quantity and quality.Water supply from underground sources requires energy consumption and surface water resources are limited and unstable.The aim of this research is managing optimal supply of required water of system with minimum energy cost and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 68  شماره 

صفحات  -

تاریخ انتشار 2017